Term Weighting Using Term Dependence

نویسندگان

  • Raj Kishor Bisht
  • Garima Srivastava
  • H. S. Dhami
چکیده

Performance of an information retrieval system depends on its weighting scheme. Weighting of a term can be seen in two aspects, local and global. For each type of weighting scheme, generally, single terms are considered. Term dependency is quite natural in a document. Word pairs or phrases can better describe a document in place of single terms. In the present paper an attempt has been made to study and quantify the dependency of terms to each other. Term dependency has been utilized to define a local weighting scheme for word pairs. Utility of the proposed weighting scheme has been shown by arbitrarily choosing some documents and extracting relevant word pairs from the documents.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Augmentation of paired pulse index as short-term plasticity due to morphine dependence

Abstract* Introduction: Chronic morphine exposure can cause addiction and affect synaptic plasticity, but the underlying neural mechanisms of this phenomenon remain unknown. Herein we used electrophysiologic approaches in hippocampal CA1 area to examine the effect of chronic morphine administration on short-term plasticity. Methods: Experiments were carried out on hippocampal slices taken f...

متن کامل

Weighting in Information Retrieval Using Genetic Programming: A Three Stage Process

This paper presents term-weighting schemes that have been evolved using genetic programming in an adhoc Information Retrieval model. We create an entire term-weighting scheme by firstly assuming that term-weighting schemes contain a global part, a term-frequency influence part and a normalisation part. By separating the problem into three distinct phases we reduce the search space and ease the ...

متن کامل

A Note on the Effect of Term Weighting on Selecting Intrinsic Dimensionality of Data

The effect of term weighting on selecting intrinsic dimensionality of data is discussed. Experiments are conducted, using different term weighting and dimensionality selection methods, on four testing document collections (namely Medline, Cranfield, CACM and CISI). The results point that transforming the data matrix using a term weighting scheme plays a vital role in identifying the intrinsic d...

متن کامل

The Effect of Term Importance Degree on Text Retrieval

Various approaches to index term-weighting have been investigated. In fact, term-weighting is an indispensable process for document ranking in most retrieval systems. As well actual information retrieval systems have to deal with explosive growth of documents of various sizes and terms of various frequencies because an appropriate term-weighting scheme has a crucial impact on the overall perfor...

متن کامل

A Novel Term Weighting Scheme Midf for Text Categorization

Text categorization is a task of automatically assigning documents to a set of predefined categories. Usually it involves a document representation method and term weighting scheme. This paper proposes a new term weighting scheme called Modified Inverse Document Frequency (MIDF) to improve the performance of text categorization. The document represented in MIDF is trained using the support vect...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010